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1 Introduction 



Experiments and historical data analysis with the spatial model of voting 
call for underpinning theoretical work. Two categories of results needed 
to support empirical research, that have only recently begun to receive 
attention, are computational methods and statistical convergence. 

This paper surveys results of these types, contributes some new results, 
and states open problems for further research. We can attempt to give 
a fairly complete picture of the current status of this research area, on 
account of its newness. We hope that the work surveyed here will aid in 
the empirical study of the spatial model, and stimulate additional advances 
and applications. In the remainder of the introduction we explain the 
motivation for each of the two categories of results, and state the principal 
new results given here. 

Computational methods. A typical scenario that requires a computational 
method arises as follows: a researcher has numerical data giving the loca- 
tions of voter ideal points and of the resulting group choice. The researcher 
wants to determine whether the outcome lies in a particular solution set, 
to test the predictive power of the solution concept. This requires that the 
solution set be computed, with respect to the numerical data. 

If the number of voters is small this computation can ordinarily be 
performed by hand. But as the number of voters grows, the difficulty of 
the computation may increase enormously. 

We temporarily depart from our scenario to introduce informally some 
notions of computational complexity. In the field of computer science, com- 
putational requirements are generally measured as a function of the size of 
the problem (in this case, the number of voters) (see 15]). Larger size 
problems require more computer time: the key question is, at what rate 
of increase? Roughly speaking, an algorithm, or computational procedure, 
is considered “fast” if it requires computer time growing as a polynomial 
function of the problem size. An algorithm is considered “slow” if it re- 
quires time growing as an exponential function of the problem size. For 
example, a procedure that examined all pairs of voters would require time 
quadratic in problem size, and would be fast; a procedure that examined 
all subsets of voters would require about 2" time, where n = the number of 
voters, and would be slow. There is a corresponding taxonomy of problems: 
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a problem is considered “easy” if there is a fast algorithm known to solve 
it; a problem is considered “hard” if no fast algorithm is known that solves 
it. The “NP-complete” problems are a prominent class of “hard” problems. 

Returning to our scenario, we have the unfortunate situation that almost 
any solution concept worth its salt is “hard”. This is a consequence of 
an observation and a theorem. The observation is, most good solution 
concepts coincide precisely with the Condorcet winner, when the latter 
exists. The theorem is, it is hard to determine if there is a Condorcet 
winner [ l] . Therefore, most good solution concepts implicitly check if a 
Condorcet winner exists, and so computing them must be hard as well. 

This does not mean that the researcher in our scenario is doomed to 
failure. But this does mean that our scenario demands genuine expertise 
in computational methods. As researchers begin to work with data from 
committees comprised of more than a handful of voters, (e.g. legislative 
assemblies), we must pay some serious attention to computational meth- 
ods to support this work. Good algorithmic tools are needed to compute 
win sets and critical levels for various solution concepts, and to fit spatial 
locations to voting data 1 . 

The principal new results in the area of computational methods are: 
NP-completeness of one-dimensional recovery of spatial locations from his- 
torical voting data; improvements in heuristics for multidimensional spatial 
location recovery; fast algorithms to compute Simpson-Kramer points and 
supermajority win sets in fixed (low) dimension; fast algorithms to compute 
membership and critical levels of epsilon-cores, in 2 dimensions. 

Statistical consistency . 

Empirical studies with the spatial model will almost inevitably involve 
some degree of randomness, due to variations in individual behavior, im- 
precision in measurements, use of sample data (e.g. survey or poll data), 
and other uncertainties of information. As discussed in [29], we would like 

1 Fast algorithmic tools are a practical possibility, despite the theoretical “hardness” 
just explained. The whole field of heuristics in operations research is devoted to practical 
means of overcoming computational difficulty. One reason this is possible is that the 
taxonomy given is “worst-case” - many or most numerical cases of a problem may be easy 
to solve, even if the problem is hard. For spatial data in particular, the theorem cited 
( [ l] ) is true only when the dimension is permitted to be large. If the dimension is fixed at 
a small level (e.g. 2 in most current empirical studies) the problem, though not trivial, is 
technically “easy”. 
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to extract data on the ideal points of committee members or a population, 
and make a prediction regarding the outcome based on a solution concept. 
Can we be confident that a prediction based on polls taken one day will be 
close to the actual results the next day, given the random factors mentioned 
(individual variability, incomplete survey information, etc.)? 

One approach to this problem is to think of the population’s views as 
having a probability distribution. When a person responds to a survey or 
votes, it is on the basis of a random sample from this distribution. The 
problem is then to establish the stability of a solution concept under these 
conditions. 

In the language of probability and statistics, a finite sample of n points 
from a probability distribution n corresponds to an empirical measure fi n . 
This measure puts mass 1/n at each of the n sample points. A solution 
concept is a function / operating on probability measures, mapping to sets 
in If we could establish that 

Jim f(nn) -► f{n) a.e., 

then the sample statistic f{^ n ) would be an asymptotically consistent esti- 
mator for and we would be confident of the limiting behavior of the 

sample solution concept. 

Thus the second category of results consists of proofs of consistency of 
sample estimators for various solution concepts. The new results include 
the consistency of the sample win set under supermajority voting, the yolk 
center, and the epsilon-core and critical level. 



2 Recovery of spatial locations 

In this section we address the problem of “recovering” spatial locations from 
voting data. From a different point of view, we can define the problem as 
that of fitting the spatial model to data. The idea is, we have data such 
as roll call votes on how each member of a population voted on a set of 
issues. We want to determine locations for the voter ideal points, or issue 
locations, or both, that best fit the data. 

Let us be precise. In the recovery problem, we are given the follow- 
ing data: a list of voters indexed a list of proposals a 

dimension d\ voting data in the form of an n by m matrix A, where 
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• A t] = 1 if voter i voted for proposal j 

• Aij = — 1 if voter i voted against proposal j 

• Aij = 0 if voter i did not vote on j or the information is not available. 

The recovery problem is to specify locations in SR 11 for the voter ideal 
points and for the “yea” and “nay” of the proposals, that best fit the 
data A^ according to some criterion. The simplest criterion would be 
to minimize the number of “errors,” i.e., conflicts between the data and 
the model predictions. When this criterion is used the problem will be 
referred to as the simple recovery problem. More sophisticated weighted 
error measures are also possible. For example, one could base the measure 
on a probabilistic model of individual choice as in [9,8’, (see [22,21]). 

Some researchers locate ideal points by an analysis of the content of 
the proposals [16, e.g.]. This method has both potential advantages (use of 
expert knowledge, predictive power) and drawbacks (judgement-based, not 
easily replicable). The recovery problem defined here may not arise when 
this method is used. A constrained version of the recovery problem however 
might arise, if proposal content were used to locate the proposals, and then 
voting data were used to recover the voter ideal points. This constrained 
version can be solved optimally by methods presented later in this section. 
(The content-based method also seems related to the recovery problem in 
that the study of good solutions to the recovery problem could help clarify 
the understanding of content, see 23] for example.) 

The principal results of this section are: 

• It is XP-complete (i.e. computationally difficult in the worst case) to 
solve the simple recovery problem, even in one dimension. 

• In low dimension (e.g. 3 or less) there are computationally efficient 
algorithms to optimally recover ideal points given fixed proposal lo- 
cations, and vice-versa (recover proposal locations given fixed ideal 
points), for the simple recovery problem. Improved heuristic algo- 
rithms can then be developed for both simple and general recovery in 
low dimension. 

The complexity result provides theoretical justification for employing 
heuristic methods to solve the recovery problem, such as the alternating 
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heuristic of Poole and Rosenthal [22]. The algorithms here lead to some 
refinements of this heuristic. 



2.1 Complexity of the simple recovery problem 

Surprisingly, the simple recovery problem is not simple, even in one dimen- 
sion. 

Theorem 1: The simple recovery problem is NP-complete m one dimen- 
sion. If the orientation of each proposal is fixed, the problem remains NP- 
complete. 

Proof: The reduction is from feedback arc set. Given am arbitrary directed 
graph G, insert a vertex into each arc. Each arc becomes a path of length 
2 and the modified graph G is bipartite. Associate a voter with each node 
in the first part of G and a proposal with each node in the second part. An 
arc from a voter i to a proposal j will mean a yes vote, A,- ; = 1; an arc from 
proposal j to voter i will mean a no vote, A t} - = — 1; otherwise A,j = 0. Pad 
the voter set with K “yes” voters, who vote in favor of every proposal, and 
an equal number of “no” voters, who vote against every proposal. For large 
K these extra voters force the orientation of each proposal. In particular, 
K is large enough if K = the number of arcs in G. Without loss suppose 
the yes voters will be placed at the bottom of the ordering and the no voters 
will be at the top. (This padding ensures that the problem remains hard 
even if the proposal orientations are fixed.) Then arranging the voter ideal 
points and proposal centers to minimize the number of errors is equivalent 
to arranging the vertices of G to minimize the number of downward pointing 
arcs. This is the feedback arc set problem on G. 

The last step of the proof is to show this is also the feedback arc set 
problem on G. Now, minimizing the number of downward pointing arcs is 
equivalent to determining a minimum cardinality subset S of arcs in G such 
that every directed cycle in G contains at least one member of 5. Similarly 
let S denote the minimum feedback arc set of graph G. Suppose arc (u,v) 
in G corresponds to arcs (u,j),(j, v) in G, t'.e., j was the vertex inserted 
into arc (u,t>). The two arcs ( u,j ) and (j, v) intersect the same cycles in 
G, thus the minimum cardinality S does not contain both. Hence there 
exists a 1-1 correspondence between members of S and members of 5, and 
minimizing |5| minimize |S| as well. 
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Finally we observe that the reduction is legitimate, because the matrix 
A has size polynomial in |G|. □ 

The reduction shows that the recovery problem is similar to the prob- 
lem of determining the Kemeny consensus, and is hard for related reasons 
[2]. The key similarity is this: in both cases we are attempting to sum- 
marize highly multidimensional information from many voters into a single 
permutation, and we measure the quality of the summary by counting the 
disagreements between it and the original multivoter data. Intuitively, this 
is why both NP-completeness reductions are from the feedback arc set prob- 
lem, where arcs pointing the wrong way are the disagreements. Of course 
all NP-complete problems are equivalent in a formal sense; the similarity 
described here is meant to be illuminating, though informal. 

2.2 Optimal recovery of ideal points with respect to 
fixed proposals 

Suppose the proposal locations are fixed. This could arise if they are exoge- 
nously determined, or during a step of a heuristic procedure for the recovery 
problem as in [21]. The recovery problem then reduces to n independent 
voter location problems. Conversely, if ideal points are fixed, the proposal 
location recovery problem reduces to m independent location problems. We 
note that proposal location in the dual or polar space is identical to voter 
location in the usual space. Thus all results in this subsection apply to 
both problems. 

Theorem 2: The simple recovery problem with fixed proposals is NP- 

complete in arbitrary dimension, and polynomially solvable in any fixed 
dimension. The corresponding proposal location recovery problem with fixed 
ideal points has the same complexity. 

Proof: Each proposal location can be taken as a hyperplane in iR d with 
orientation (indicating the “yes” halfspace). If a voter is placed in the 
“wrong” halfspace, an “error” results. The recovery problem therefore is 
equivalent to the following: given a collection of linear inequalities, find a 
location in SR 4 that minimizes the number of violated inequalities. This is 
NP-complete [ll, page 267]. 

In fixed dimension, a well known combinatorial formula (see [3, e.g.]) 
states that m hyperplanes partition dl d into distinct regions (or 
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fewer if they are not in general position). Thus the voter must be placed 
in one of 0(m d ) possible regions. Obviously the error count is the same 
within any region. The region with the least error count is the optimal 
solution, and can be found in polynomial time. □ 

The Poole-Rosenthal recovery heuristic selects the coordinates of the 
locations one dimension at a time. For each dimension, the heuristic al- 
ternates between fixing the ideal points and fixing the proposal locations, 
until a locally optimal solution is reached (the method ceases to improve 
the solution) [22]. When the coordinates for a dimension have been se- 
lected in this way, the heuristic seeks a locally optimal solution for the next 
dimension, and so on until all are selected [21,23]. When the simple error 
count is to be minimized, or the random factor is small, Theorem 2 suggests 
an alternate version of this heuristic, in which all dimensions are consid- 
ered simultaneously. The more powerful neighborhood structure provided 
by Theorem 2 suggests that the alternate version would perform better. 
Of course, given Theorem 1, it is unlikely that the alternate version will 
always dominate the other in numerical performance. To be sure of domi- 
nance, one could apply the alternate version to the solution generated by 
the original heuristic: this hybrid heuristic would be guaranteed always to 
perform at least as well as the original heuristic. 

Theorem 2 hold promise for weighted recovery problems, as well. In the 
unweighted recovery problem, the penalty is constant within each polyhe- 
dral region, so one simply has to pick the best region. In the more general 
weighted recovery problem, the penalty will vary within each region. If the 
penalty function behaves tractably, on one side of the proposal hyperplane 
(e.g. it is convex), then one simply has to solve a convex minimization prob- 
lem for each region (the sum of convex functions is convex). This would be 
computationally feasible in two or three dimensions. 

Unfortunately, the most widely used penalty functions are not convex. 
Rather, they tend to flatten out far away from the proposal hyperplane. 
That is, the penalty for being on the wrong side of the hyperplane is nearly 
the same at distance 100 or 101; similarly it isn’t much better to be on the 
right side at distance 101 than to be on the right side at distance 100. But 
there is a big difference between being distance 0 (i.e. on the hyperplane) 
and distance 1. 

For these penalty functions we modify the construction of regions given 
in the proof of Theorem 2. See Figure 2.2: the penalty function is convex on 
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Figure 1: Piece-wise linear approximation of non-convex penalty function 

the correct side of the hyperplane, but not on the wrong side. Approximate 
the penalty on the wrong side by a piecewise linear function. (Note that 
the piecewise function is concave.) For simplicity suppose there is only one 
breakpoint. Now place a hyperplane parallel to the proposal hyperplane, at 
distance corresponding to the breakpoint. If this is done for each proposal, 
there will be 2m hyperplanes in $i d , and still only 0(m d ) regions. (The 
upper bound will increase by roughly 2 d , a modest amount for d = 2 or 
3. Actually for d = 2 the number of regions increases by a little less than 
a factor of 3.) Within each of these regions, the penalty function will be 
a sum of linear and convex functions. Now one simply solves the convex 
program for each region. If linear programs are desired, approximate the 
penalty function on the correct side by a piecewise linear function as well. 
If a better fit is desired, one can use piecewise convex instead of piecewise 
linear functions, or one can increase the number of breakpoints. 



3 Computational methods for Simpson-Kramer 
win sets 

In this and the next two sections, we consider algorithmic procedures for 
computing solution concepts and associated values. The first solution con- 
cept we address involves supermajority or a-majority voting. 

Fix the voter ideal points V. For any 1/2 < a < 1 let PF(a) denote 
the win set with respect to supermajority rule at level a. Let a'(a:) de- 
note the smallest supermajority level at which x is undominated (in the 
core), inf Q {ct : x E IF(a)}. Let ct^F) denote min r a*(x), the smallest level 
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at which the core is nonempty. Thus W(o:*(K)) is the Simpson-Kramer 
minimax point. These supermajority win sets have very interesting and 
powerful properties, see [17,12,5,4]. In this section we develop computa- 
tional methods for these concepts. 

There are several related algorithmic problems: 

1. Membership: given V, x, and a, is x € W(a)? (Is x undominated 
with respect to a-majority rule?) 

2. Critical Level: given V , x, find a*(x). (What is the smallest super- 
majority level a at which x is undominated?) 

3. Win Set: given V, a , find the set W(o:) or determine that it is empty. 
That is, find some “good” representation of the core for the given 
value of a. 

4. Minimax Level & Point: given V, find a*(K) and IV(a*(V)). (Find 
the Simpson-Kramer minimax point and its associated a level.) 

As stated in the introduction, all these problems are NP-hard in arbi- 
trary dimension (this follows from the co-NP-completeness of determining 
if x € W (1/2) [1,14]). Fortunately, all these problems may be solved fairly 
easily in low dimension. 

Algorithms for (l)-(4) 

(1): Membership. The point x is undominated with respect to a- 

majority rule iff no open halfspace defined by a hyperplane through x con- 
tains more than a|V| ideal points. Therefore we can determine membership 
by finding the “densest” open halfspace, the open halfspace containing the 
most points of V , and counting the number of points it contains. John- 
son and Preparata [14] provide polynomial algorithms to find the “densest” 
closed and open halfspaces with respect to a set of points in fixed dimension. 
This resolves membership in fixed dimension. 

One drawback to the algorithm of [ 14] is that it is moderately compli- 
cated to implement. Here we suggest an alternate algorithm that is con- 
ceptually simple, and has the same order of speed on nondegenerate cases. 
The algorithm should be easy to implement, particularly as it requires com- 
putations similar to those for the yolk. The principal disadvantage to our 
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algorithm is its relatively poor performance on degenerate configurations, 
although it is still “feist” (polynomial time) for any fixed dimension. In four 
or more dimensions, the algorithm of [14] would be much faster on highly 
degenerate cases. 

Construct all the “extremal” or “limiting” hyperplanes through x, and 
count the number of ideal points in each halfspace. (A limiting hyperplane 
passes through d affinedly independent ideal points in 9? d . The point x is 
counted as an ideal point, whether or not x E V .) This finds the densest 
closed halfspace, with the same time complexity as [14]. The search for a 
densest closed halfspace be restricted to just the “limiting” hyperplanes, 
even for degenerate configurations, because any hyperplane through x can 
be “tilted” so as to touch d ideal points, while keeping at least as many 
ideal points in each closed halfspace defined by the hyperplane '25, Lemma 
5]. Let h denote the hyperplane defining this halfspace, and let V\ denote 
the points of V on h, excluding x. If the configuration is nondegenerate, it 
is possible to perturb h to a new hyperplane h, which passes through x, and 
leaves all the points on its “dense” side. This perturbation is possible 
because |F h | = d — 1 when the configuration is nondegenerate. 

Note that the fundamental operations involved are: (i) finding the lim- 
iting hyperplanes, and (ii) counting the number of points in the halfspaces 
so defined. These operations comprise part of the fundamental operations 
in yolk computations, which should be an advantage to implementors. (In 
addition, one must count the number of points on h to verify |V),| = d — 1, 
but this is trivial given (ii). In two (or more) dimensions, the algorithm can 
be sped up by a factor of n/ log n by the preprocessing of V [7], just as with 
yolk computations [31]. The algorithm requires time 0{n d ~ l log n) in the 
nondegenerate case (or for the general case, if a densest closed halfspace is 
desired), the same as the algorithm of [14] . 

It remains to modify the algorithm in the case of degeneracy. If config- 
urations are degenerate, it may not be possible to perturb h to get all the 
points of V k on the same side. (Indeed the densest open halfspace might 
not be a perturbation of the densest closed halfspace. However, the densest 
open halfspace is defined by a perturbation of some limiting hyperplane.) 
Fortunately, the problem of finding the best perturbation of h, that which 
gets as many points as possible on the dense side, is precisely the densest 
hemisphere problem in a lower dimensional space. This simplifies imple- 
mentation considerably. 
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Figure 2: Finding h in 2 dimensions 

To be precise, we must find all limiting hyperplanes h, and for each 
h find its best perturbation h. If |Vj,| = d — 1 this requires no additional 
computation. If there are more points on h, we recursively solve the densest 
open hemisphere problem at x for the point set V h , in the d — 1 dimensional 
subspace h. The base case of the recursion is two dimensions: when d = 2, 
(see Figure 3) h is a line through x passing through 2 or more (other) points 
of V . This line is divided into two half lines emanating from x. Select the 
half line with the greater number of points. □ 

Both algorithms extend immediately to the case of weighted voters. 

(2) : Critical Level. This could be accomplished with binary search on a 
using (l). However, examination of either algorithm for (l) shows that they 
actually compute a*(x) directly, since they find the densest open halfspace. 

□ 

(3) : Win Set. Find all the “limiting” hyperplanes h (those passing 
through d ideal points). Form a list E of all closed halfspaces defined by 
these h. Extract a list L of closed halfspaces h + containing cm + 1 or 
more ideal points. If a point is to be in W(a) it must be in each of these 
closed halfspaces, for otherwise some other point could get an + 1 or more 
supporters to defeat it. Therefore the win set is the set of feasible solutions 
to the linear program that constrains points to be in each closed halfspace 
on the list L. □ 

As a corollary, we find that the win set is a polyhedral set. (This was 
established for the special case a = a’ by Kramer [17]). 

(4) : Minimax Level. This can be found, in principle, by binary search 
with (3). A direct implementation would proceed as follows: sort the list 



12 



E from (3) (there are 0(71^) in this list) in order from largest to smallest 
number of ideal points contained (descending order of \V fl h + |). Perform 
binary search on the cutoff point of E. Each “query” for the binary search 
solves a feasibility linear program with 0(n d ) constraints in d dimensions. 
(For computational purposes, the dual would probably be faster to solve, 
since there are far more constraints than variables in the primal.) □ 

If preferences are linear rather than Euclidean (see '4’) all these prob- 
lems reduce to (2), Critical level. This is because we need to know the 
densest halfspace, the open halfspace containing as many preference gra- 
dients as possible. As already observed, the densest halfspace problem is 
solved in [14; or may be solved with the alternative algorithm described 
above for (l). 

4 Computational methods for the yolk 

The yolk is another important solution concept, motivated by experimental 
data and possessed of beautiful theoretical properties [10,18]. As with the 
Simpson-Kramer minimax point, it is NP-hard to determine the yolk in 
arbitrary dimension (any solution concept which coincides with the classical 
core, when the core is nonempty, will suffer from this complexity because 
it is co-NP-complete to determine if the classical core is nonempty 111 ). 

For two dimensions, the computational situation is much rosier. Al- 
though the extremal median hyperplanes employed in Section 3 are not 
enough to determine the yolk [27], there is a polynomial algorithm to com- 
pute the yolk radius and center, in fixed dimension [31]. In two dimen- 
sions, this algorithm requires provably 0(n 45 ) time, and has even better 
time complexity o(n 3+f ) Ve > 0 (i.e. nearly cubic) if a conjecture of Erdos, 
Lovasz, et al. is true. 

We should also remark that the linear program given in [18] is easily 
computed and should make an excellent heuristic. It provides a rigorous 
lower bound on the yolk radius, and when its solution is exact this event 
can be determined in time 0(n 2 ) [31]. 

Koehler has proposed extending the yolk to n even. The algorithm 
cited above also works in this case, with the same time complexity. The 
algorithm also works for “supermajority rule” yolks (see [16,28]). 
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For the 3-dimensional case, the algorithm of [31] has time complexity 
C>(n 10 ) and would not be practical for more than a handful of points. This 
case needs more development but appears to be within reach through either 
effective heuristics or an improved optimizing procedure, perhaps making 
strong use of the convexity of the yolk radius function. 

5 Computational methods for other solution 
concepts 

The uncovered set 

The uncovered set is another solution concept with powerful theoretical 
properties. It is defined as follows: alternative x covers alternative y iff x 
defeats (is majority preferred to) y, and for all z which y defeats, z defeats 
z. An alternative z is uncovered iff there is no other alternative that covers 
z [20,19]. 

The computational picture with respect to the uncovered set is bleak at 
present. Hartley and Kilgour [13] show how to find the uncovered set in two 
dimensions when n = 3, and their method is not simple. To the author’s 
knowledge, no efficient general procedure is available even for two dimen- 
sions, although no forbidding complexity levels (e.g. NP-completeness) 
have been established for the fixed dimensional case, either. This remains 
a challenging open problem. 

To get an idea of why the problem is not simple, consider the presumably 
easier question of determining whether a given alternative z covers another 
given alternative y, in two dimensions. For any point z € 9? 2 and any 
0 < 9 < 2tc, let f(z,d) > 0 denote the distance from z along the ray at 
angle 9 to the first point that z defeats. (Technically / is defined as an 
infimum to allow / = 0). It is obvious that z will defeat all points on 
the ray past this first point. It is fairly straightforward to see that for 
any z, f(z,9) takes the form of a piecewise trigonometric function with 
polynomially many breakpoints. Then the question of whether z covers y 
is essentially the question of whether the exterior region in the plane of one 
such function about z contains the exterior region of another such function 
about y. This could be checked in polynomial time, because each pair of 
the trigonometric functions will intersect a small number of times, but the 
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procedure would be tedious to say the least. 

Therefore, to determine whether x covers y is polynomially solvable, 
but seems near the limit of computational practicality. Fast methods for 
determining the uncovered set in 2-D may be difficult to attain. On the 
brighter side, McKelvey [18] provides bounds on the uncovered set in terms 
of the yolk, which can be computed by the algorithm cited in Section 4. 

The epsilon-core 

Another solution concept of interest is the epsilon-core: a point x is 
considered undominated if there is no point y which is more than e closer 
to a majority of the voters. This concept was introduced by Shapley and 
Shubick [26]; promising empirical results are given in [24] ; some motivations 
and properties of the epsilon core for the spatial model are given in [30]. 

The epsilon-core presents the same four algorithmic problems as super- 
majority voting. These are, 

1. membership: given e, x is x in the c-core? 

2. critical level: for given x, find the least value of e for which x is 
undominated. 

3. win set: for given e, find the epsilon-core. 

4. minimum level: find the least value of e for which the epsilon-core is 
nonempty. 

Here we give here fast computational methods to solve the first two 
problems, membership and critical level, in 2-D. At present I do not know 
computationally efficient methods to solve the other problems, win set and 
minimum level. 

Algorithms for epsilon-core membership and critical level 

Membership: For given e, the point x is in the e-core if none of the hyperbolic 
regions, with one focus at x and parameter e, contain more than half the 
voter ideal points. These regions are all congruent and are found by rotating 
a hyperbolic curve around x. The number of points in the region inside the 
curve only changes when the curve passes over an ideal point. Therefore we 
can restrict our attention to the 2(n — l) = 0(n ) curves that intersect an 
ideal point. If none of these contain more than half the voter ideal points, 
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then x is undominated. The membership algorithm is therefore 0(n 2 ). (The 
same idea works in higher (fixed) dimensions, except one must be careful 
if the ideal points are sparse: when n is much larger than d the method 
extends easily.) □ 

Critical level: This is solved by binary search on e, using the membership 
algorithm as the query subprocedure. □ 

As an approximate method for win set and minimum level, the critical 
level algorithm could be invoked for each point of a grid. Decreasing the 
spacing of the grid would give as fine an approximation as desired. 



6 Convergence and asymptotic consistency 
of sample estimators 

In this section we survey convergence results for several solution concepts, 
as motivated in the introduction. 



6.1 Simpson-Kramer win sets 

There are several related convergence questions here, just as there are sev- 
eral related algorithmic questions in Section 3. Corresponding to “Member- 
ship” we could define a 0-1 valued random variable which indicates whether 
x is an a-majority winner or not. Unfortunately it is not the case that the 
sample statistic would be consistent. In fact we could have the sample 
statistic converge to 0 a.s., while the distribution value equals 1 (see [29]). 
The other convergence questions have nicer answers. 

• Minimax value: Does a‘(/r n ) — ► a*(^)? 

• Minimax set: Does S a .^ n )(p n ) -*• S a . ( ^(p)? 

• Critical level: Does a(x, p n ) — *■ a(x,p)l 

• Win set: Does S(a,p n ) — > S(a,p) in some suitable topology? 

Demange [6] considers a sequence of probability measures p n converging to 
p and shows convergence of Minimax value and Minimax set. The proof ap- 
pears to assume continuity of p n , and may not apply to the case considered 
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here, where is a discrete (empirical) measure. Caplin and Nalebuf [5] 
establish a.s. convergence of Minimax value under stated assumptions that 
[x is concave and has compact support. As they comment, however, their 
proof applies equally well to the more general case of continuous densities 
[i with compact support. Moreover, their proof also shows convergence of 
Critical level. Convergence of Minimax value for arbitrary is proved in 
[29], 

Minimax set and Win set are a little different and appear to require 
stronger assumptions on p. These questions ask if the location of the so- 
lution set converges, rather than if the necessary parameter level for the 
nonemptiness of the solution set converges. In [29], convergence of Minimax 
set is proved, under the assumption that p. is continuous and has unique 
minimax point. Convergence of the Win set will follow from arguments 
given here, under the assumption that fi is continuous and has compact 
support (see Section 6.3). 

6.2 The yolk 

There are two related convergence questions for the yolk, pertaining to 
its size and location. Let r(p) and c(p) denote the radius and center, 
respectively, of the yolk with respect to a distribution fx. Convergence 
of r(p„) to r(p) (ui.p.l) is proved in [30], assuming p is continuous and 
strictly positive in its region of support. If no restrictions are placed on p, 
the sample radius r(p n ) may not have a limit. I conjecture, moreover, that 
in sufficiently high dimension, it can occur that limsup n r(p n ) < r (^)- 
Here w r e prove the convergence of the yolk center. 

Theorem 4: If p is positive continuous on and has compact support 
then lim n _ 0o c(p n ) — «• c(p) a.s. 

Proof: We will need two lemmata. 

Lemma 1: Let r(s,p) denote the radius of the 2 -centered yolk. Then 
lim n ^, 00 r(z, p n ) — + r(z,p) a.s. uniformly over all z. 

(This is proved in [30].) 

Lemma 2: The function r( 2 ,p) is continuous and convex in z. 

(The proof is easy and found in [31]). 

Let c denote c(p), the center of the distributional yolk. Then r(c,p„) — ♦ 
r(fx) a.s. by Lemma 1. Now let e > 0 be arbitrary. We will show that for 
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all 2 6 $l d more than distance e from c, eventually z £ c(p , n ) w.p. 1. This 
will imply that c(p n ) must almost surely converge to points within c. 

Let D denote the set of points in the support of p, at distance e or more 
from c. D is compact so by Lemma 2, r(z,p) attains a minimum value 
td when z ranges over D. By assumption D does not intersect c. Hence 
r D > r{n). 

Let 6 — r D — r(p) > 0. By Lemma 1, eventually as n — > oo 
sup |r(z,p n ) — r(z,p)| < <5/3 a.s. 

Z 

So eventually for all z G D and all points p in c we have, a.s., r(z,p„) > 
r D — <5/3 > r D — 26/3 > r(p,p„). Hence eventually z c(p„) a.s. since 
there is another point p with smaller p-centered radius. □ 

Lemma 3 in [31] gives a related result: if V is any finite configuration 
of ideal points, and V is an arbitrarily slight nondegenerate perturbation 
of V , then the yolk radius and center of V can be made arbitrarily close to 
the yolk radius and center of V . In the case |H| even, the center of V may 
not be unique. In this case, the yolk center of V can be made arbitrarily 
close to some (but not any) yolk center of V. This result, too, guarantees 
a certain robustness of the yolk solution concept. 

6.3 The epsilon-core 

In this subsection w r e show convergence properties related to the epsilon- 
core. This solution concept and the a-majority concept have similar prop- 
erties. 

Let e(x,p) denote the smallest value of e that makes x undominated; let 
e' (p) denote the smallest value of e for which the epsilon-core is nonempty; 
let EC(e,n) denote the epsilon-core at value e; let EC’(p) denote the 
epsilon-core at value e’(p). 

Theorem 5: If p is continuous on and has compact support then with 
probability 1, 

1. c(x,p n ) * e(x, p). 

2. e*(p„) e-(p). 

3. Ye > 0, £’C(e,p n ) — *• EC(e, p). In particular, if e*(p) > 0, then 
EC'{p n ) -> EC'(p). 
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4. Vet, S(a,fi„) -+ S(a,n). 

Proof: The key property we rely on is that the probability measure 
of the region defined by a hyperbolic hypersurface changes continuously as 
the surface is translated continuously through SR 1 *. This follows immediately 
from the assumptions. Now apply the uniform convergence of the empirical 
measure ix n to ^ over all these hyperboloid regions ([30, Lemma 2\) to get 
convergence of the critical level e(x,/i) and the minimum level e*. 

To prove convergence of the win sets EC(e), we use the same argument 
as for the convergence of the yolk center in the previous subsection. That 
is, once we know critical level and minimum level converge, the continuity 
of the probability measure, and the compactness of its support, imply that 
for all 7 ? > 0 eventually all points more than t] from the win set EC(e,fx) 
will have critical level greater than t. The same proof works for convergence 
of S(a,/J, n ), the minimax win set, as well. □ 

7 Computations for large data sets and di- 
rections for further research 

7.1 Computing solution sets for very large popula- 
tions 

If a population is very large, even the relatively fast algorithms of section 
2 may be too slow for practical purposes. For these situations 2 we may 
combine the two categories of results in this article, to get fast, accurate 
computations. Suppose two random samples of n points are drawn from a 
distribution /j, leading to empirical measures f. i n and n' n . Suppose further 
we are interested in f(n), where the a.s. asymptotic consistency of the 
sample estimates /(/r n ) is established. Then since both and f{n' n 

converge to f(n) a.s., it must be that /im n _ 0O /(/^ n ) = /im n _ 0o /(^' n ) a.s. 

Now suppose we have a data for a large population P. The data may 
be thought of as a very large sample from some continuous distribution ji. 
By the asymptotic equivalence between sampling with and without replace- 
ment, we can approximate a sample from /i by a sample from P . Think of 

2 this idea is due to Richard McKelvey 
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P as mu n . To estimate f{P), we draw samples from P, and compute 
f(n' m ). The equality of limits in the preceding paragraph assures us that 
(for large enough m) the sample value f{n' m ) * s a g°°d estimate of the 
value of = f{P)- If n is very large, the required sample size m would 

be much smaller than n, rendering the problem computationally tractable. 
(Note, if P has multiplicities, this reasoning fails. I think the method would 
still work well, but have no mathematical justification, except in the case 
of the yolk, where Lemma 3 in [31] would fill in the gap.) 

7.2 Directions 

This paper has assembled a number of computational and convergence re- 
sults in support of empirical research with the spatial model. As indicated 
in the text, several computational problems remain open. These include 
finding good methods for the uncovered set and epsilon-core win sets, and 
also for the yolk in 3 or more dimensions. 

Regarding convergence results, finding rates of convergence appears to 
be an important open research area. If these could be estimated, and the 
role of the variance of n clarified, it should then be possible to construct 
statistical tests (e.g. confidence intervals) for empirically computed values 
such as the yolk radius and location, or the minimax level. 



References 

1 John J. Bartholdi, N. Narasimhan, and Craig A. Tovey. Recognizing 
majority-rule equilibrium in spatial voting games. Social Choice and 
Welfare , 1991. to appear. 

[2] John J. Bartholdi, Craig A. Tovey, and Michael A. Trick. Voting 
schemes for which it can be difficult to tell who won the election. 
Social Choice and Welfare, 6:157-165, 1989. 

[3] R.C. Buck. Partition of space. Americam Mathematical Monthly, 
50:541-544, 1943. 

[4] Andrew Caplin and Barry Nalebuff. Aggregation and social choice: a 
mean voter theorem. Econometrica, 59:1-24, 1991. 



20 



[5] Andrew Caplin and Barry Nalebuff. On 64% majority rule. Econo- 
metrica, 56, 1988. 

[6] Gabrielle Demange. A limit theorem on the minmax set. Journal of 
Mathematical Economics, 9:145-164, 1982. 

[7] H. Edelsbrunner, D. G. Kirkpatrick, and H. A. Maurer. Polygonal 
intersection searching. Information Processing Letters, 14:74-79, 1982. 

[8] James Enelow and Melvin Hinich. A general probabilistic spatial the- 
ory of elections. Public Choice, 61:101-113, 1989. 

[9] James Enelow and Melvin Hinich. The Spatial Theory of Voting. Cam- 
bridge University Press, Cambridge, 1984. 

[10] John A. Ferejohn, Richard McKelvey, and Edward Packel. Limiting 
distributions for continuous state markov models. Social Choice and 
Welfare, 1:45-67, 1984. 

1 1 ] Michael Garey and David Johnson. Computers and Intractability: A 
Guide to the Theory of N P -Completeness. W.H. Freeman, San Fran- 
cisco, 1979. 

[12] Joseph Greenberg. Consistent majority rules over compact sets of 
alternatives. Econometrica , 47:627-636, 1979. 

[13] Richard Hartley and Marc Kilgour. The geometry of the uncovered 
set in the three-voter spatial model. Mathematical Social Sciences, 
14:175-183, 1987. 

[14] David S. Johnson and F.P. Preparata. The densest hemisphere prob- 
lem. Theor. Comput. Sci., 6:93-107, 1978. 

[ 15] Donald E. Knuth. The Art of Computer Programming: Fundamen- 
tal Algorithms. Addison- Wesley Publishing Company, Reading, Mas- 
sachusetts, 1973. 

[16; David H. Koehler. The size of the yolk: computations for odd and 
even-numbered committees. Social Choice and Welfare , 7:231-245, 
1990. 



21 



[17] Gerald H. Kramer. A dynamical model of political equilibrium. Jour- 
nal of Economic Theory, 16:310-334, 1977. 

18] Richard D. McKelvey. Covering, dominance, and institution free prop- 
erties of social choice. American Journal of Political Science, 30:283- 
314, 1986. 

f 19] N.R. Miller. The covering relation in tournaments: two corrections. 
American Journal of Political Science, 27:382-385, 1983. 

[20] N.R. Miller. A new solution set for tournaments and majority voting. 
American Journal of Political Science, 24:68-96, 1980. 

21] Keith Poole and Howard Rosenthal. Patterns of congressional voting. 
American Journal of Political Science, 1990. to appear. 

22] Keith Poole and Howard Rosenthal. A spatial model for legislative roll 
call analysis. American Journal of Political Science, 29:357-384, 1985. 

23] Keith Poole and Howard Rosenthal. Spatial Realiignment and the Map- 
ping of Issues in American History: the Evidence from Roll Call Vot- 
ing. Technical Report GSIA WP 1990-41, Carnegie-Mellon University, 
Pittsburgh, PA 15213, 1990. 

[24] Stephen W. Salant and Eban Goodstein. Predicting committee behav- 
ior in majority rule voting experiments. RAND Journal of Economics, 
21:293-313, 1990. 

25] Norman J. Schofield and Craig A. Tovey. Probability and convergence 
of supermajority cores with euclidean preferences. Mathematical and 
Computer Modelling, 1991. to appear. 

[26] Lloyd Shapley and Martin Shubick. Quasi-cores in a monetary econ- 
omy with non-convex preferences. Econometrica, 34:805-827, 1966. 

27] Richard E. Stone and Craig A. Tovey. Limiting median lines do not 
suffice to determine the yolk. Social Choice and Welfare, 1990. to 
appear. 



22 



[28] Craig A Tovey. The almost surely shrinking yolk. Technical Report 150, 
Center in Political Economy, Washington University, St. Louis, Mo, 
October 1990. submitted to Mathematics of Operations Research. 

[29] Craig A Tovey. A Critique of Distributional Analysis. Technical Re- 
port NPSOR-91-16, Department of Operations Research, Naval Post- 
graduate School, Monterey, Ca 93943, January 1991. submitted to 

Econometrica. 

f 30] Craig A Tovey. The Instability of Instability. Technical Re- 
port NPSOR-91-15, Department of Operations Research, Naval Post- 
graduate School, Monterey, Ca 93943, May 1991. 

r 3l] Craig A. Tovey. A polynomial algorithm to compute the yolk in fixed 
dimension. April 1991. Presented at SIAM Symposium on Complexity 
Issues in Numerical Optimization, March 1991, Ithaca, N.Y. Submit- 
ted to Mathematical Programming. 



23 



INITIAL DISTRIBUTION LIST 



1. Library (Code 52) 2 

Naval Postgraduate School 

Monterey, CA 93943-5000 

2. Defense Technical Information Center 2 

Cameron Station 

Alexandria, VA 22314 

3. Office of Research Administration (Code 08) 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 

4. Prof. Peter Purdue 1 

Code OR-Pd 

Naval Postgraduate School 
Monterey, CA 93943-5000 

5. Department of Operations Research (Code 55) 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 

6. Prof. Craig A. Tovey, Code OR-Tv 30 

Naval Postgraduate School 

Monterey, CA 93943-5000 

7. Prof. Lyn R. Whitaker, Code OR-Wh 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 

8. Prof. Donald P. Gaver, Code OR-Gv 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 

9. Prof. Patricia A. Jacobs, Code OR-Jc 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 

10. Prof. Siriphong Lawphongpanich, Code OR-Lp 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 



11. Prof. William Kemple, Code OR-Ke 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 

12. Prof. P. A. W. Lewis, Code OR-Lw 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 

13. Prof. Marcelo C. Bartroli, Code OR-Bt 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 

14. Prof. Michael Bailey, Code OR-Ba 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 

15. Prof. Eric S. Theise, Code OR-Th 1 

Naval Postgraduate School 

Monterey, CA 93943-5000 

16. Joseph Greenberg 1 

Department of Economics 

McGill University 

1001 Sherbooke West 

Montreal PI CANADA H3A 1G5 

17. Melvin Hinich 1 

Dept, of Government 

The University of Texas at Austin 
Austin, TX 78712 

18. Howard Rosenthal 1 

GSIA 

Carnegie Mellon University 
Pittsburgh, PA 15213 

19. Stan Zin 1 

GSIA 

Carnegie Mellon University 
Pittsburgh, PA 15213 



1 



20. Victor Rios-Rull 

GSIA 

Carnegie Mellon University 
Pittsburgh, PA 15213 

21. Peter Hammer 1 

RUTCOR 

Rutgers University 
New Brunswick, NJ 08903 



22. Michael Rothkopf 1 

RUTCOR 

Rutgers University 
New Brunswick, NJ 08903 

23. Steven Shavell 1 

Harvard Law School 

Harvard University 
Cambridge, MA 02138 

24. A. Alkan 1 

Bogazi^i University 



Faculty of Economic and Administrative Sciences 
Dept, of Management 
80815 Bebek— ISTANBUL 
Turquie 

25. J. Abdou 1 

U.F.R. de Mathematiques 

Universite de Paris I 
90 rue de Tolbiac 
75634 PARIS Cedex 13 

26. J. P. Barthelemy 1 

E.N.S.T. 

Dept. Informatique 
46 rue Barrault 
75634 PARIS Cedex 



1 



27. S. Barbera 

Dept. d'Economica I Historia Economica 
Facultat de Ciences Economiques I Empresariales 
Universitat Autonoma de Barcelona 
08193 Bellaterra— BARCELONA 
Espagne 

28. G. Bordes 1 

L.A.R.E. 

Faculte des Sciences Economiques 
Universite de Bordeaux I 
33604 PESSAC 

29. D. Campbell .1 

Dept, of Economics 

University of Toronto 
150 St. George Street 
TORONTO, CANADA M5S IAI 

30. A. Guenoche 1 

G.R.T.C. — C.N.R.S. 

31 Chemin Joseph Aiguier 
13402 MARSEILLE Cedex 09 

31. L. Gevers 1 

Facultes Universitaires Notre Dame 

de la Paix 

8 Rempart de la Vierge 
5000 NAMUR 
Belgique 



32. B. Monjardet 1 

CAMS-EHESS 

54 Bd Raspail 
75006 PARIS 

33. J. Moulen 1 



Ecole Normale Superieure 

Universite de Yaounde 

BP 47 

Yaounde 

Cameroun 



1 



34. B. Peleg 

Dept, of Mathematics 
The Hebrew University of Jerusalem 
Givat Ram, 91904 JERUSALEM 
Israel 

35. M. Salles 1 

C.R.E.M.E.R.C. 

Universite de Caen 
14032 CAEN Cedex 

36. Eban Goodstein 1 

Skidmore College 

Dept, of Economics 

Saratoga Springs, NY 12866-1632 



37. David Koehler 1 

School for Public Affairs 

The American University 
Washington DC 20016 

38. Stephen Salant 1 

Dept, of Economics 

University of Michigan 
Ann Arbor, MI 48109 

39. Al Roth 1 

Dept, of Economics 

University of Pittsburgh 
Pittsburgh, PA 15260 

40. David Gale 1 

U.C. Berkeley 

Dept, of Operations Research 
Berkeley, CA 94720 

41. John J. Bartholdi 1 

ISYE 



Georgia Institute of Technology 
Atlanta, GA 30332 



42. Gideon Weiss 1 

ISYE 

Georgia Institute of Technology 
Atlanta, GA 30332 

43. Robert Foley 1 

ISYE 

Georgia Institute of Technology 
Atlanta, GA 30332 

44. Loren Platzman 1 

ISYE 

Georgia Institute of Technology 
Atlanta, GA 30332 

45. Kenneth Arrow 1 

Department of Economics, 

Stanford University, 

Stanford, CA 94305 

46. John Gilmour 1 

Center for Political Economoy 

Washington University 
St. Louis, MO 63130 

47. Norman Schofield 1 

Center for Political Economoy 

Washington University 
St. Louis, MO 63130 

48. Richard McKelvey 1 

Division of Humanities and Social Sciences, 

California Institue of Technology, 

Pasadena, CA 91125 

49. Andrew Caplin 1 

Department of Economics 

Columbia University, 

116th and Amsterdam Ave. 

New York, NY 10027 



50. Richard Stone 1 

Room H03K328 

AT&T Bell Laboratories 
Holmdel, NJ 07733-1988 

51. Ariel Rubenstein 1 

Dept, of Economics 

Tel Aviv University 
Rumat Aviv 69978 
Tel Aviv, ISRAEL 



52. Abraham Neyman 1 

Depart, of Applied Mathematics 

SUNY at Stony Brook 
Stony Brook NY 11794 

53. Scott Feld 1 

Depart, of Applied Mathematics 

SUNY at Stony Brook 
Stony Brook NY 11794 

54. Gordon Tullock 1 

Dept, of Economics 

University of Arizona 
Tucson, AZ 85721 

55. Barry Nalebuff 1 



Yale School of Organization and Management 
Box IA, 

New Haven, CT 06520 

56. Mahmoud El-Gamal 1 

Division of Humanities and Social Sciences, 

California Institue of Technology, 

Pasadena, C A 91125 

57. Michael A. Trick 1 

GSIA 

Carnegie Mellon University 
Pittsburgh, PA 15213 



1 



58. Charles Plott 

Division of Humanities and Social Sciences, 

California Institue of Technology, 

Pasadena, CA 91125 

59. Peter Ordeshook 1 

Division of Humanities and Social Sciences, 

California Institue of Technology, 

Pasadena, CA 91125 



60. Herve Moulin 1 

Duke University, Dept, of Economics 

Durham, NC 27706 

61. Bernard Grofman 1 

University of California at Irvine 

Irvine, CA 92717 

62. Michel Balinski 1 



Laboratoire de Econometrie 
Ecole Poly tech 
5 Rue Descartes 
75005 Paris FRANCE 



63. H. Peyton Young, 1 

School of Public Affairs 

University of Maryland 
College Park, MD 20742 

64. Herbert S. Wilf 1 

Department of Mathematics 

University of Pennsylvania 
Philadelphia, PA 19104-6395 

65. Prof. Man-Tak Shing 1 



Computer Science Department 
Naval Postgraduate School 
Monterey, CA 93943 



DUDLEY KNOX LIBRARY 




2768 00347309 1 



